SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Caras , Ingrid W 
(ii) TITLE OF INVENTION: A2-1 Neurotrophic Factor 
(iii) NUMBER OF SEQUENCES : 10 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 1 DNA Way 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: WinPatin (Genentech) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/635130 

(B) FILING DATE: 19-Mar-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Torchia, PhD., Timothy E. 
-(B) REGISTRATION NUMBER: 36,700 
(C) REFERENCE /DOCKET NUMBER: P1001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-8674 

(B) TELEFAX: 650/952-9881 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1877 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ix) FEATURE: 

(A) NAME/KEY: Extra Cellular Domain 

(B) LOCATION: 244-899 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
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(ix) FEATURE: 

(A) NAME /KEY : Transmembrane Domain 

(B) LOCATION: 901-978 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: signal peptide 

(B) LOCATION: 244-321 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



GNTCTAGAAN TAGTGGATCC CCCCGGGCTG CAGGAATTCC GACGGCCCCT 5 0 

GGAAGGGCTC TGGTGGGGCT GAGCGCTCTG CCGCGGGGGC GCGGGCACAG 100 

CAGGAAGCAG GTCCGCGTGG GCGCTGGGGG CATCAGCTAC CGGGGTGGTC 150 

CGGGCTGAAG AGCCAGGCAG CCAAGGCAGC CACCCCGGGG GGTGGGCGAC 2 00 

TTTGGGGGAG TTGGTGCCCC GCCCCCCAGG CCTTGGCGGG GTC ATG 24 6 

Met 
1 

GGG CCC CCC CAT TCT GGG CCG GGG GGC GTG CGA GTC GGG 2 85 
Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly 
5 10 

GCC CTG CTG CTG CTG GGG GTT TTG GGG CTG GTG TCT GGG 324 
Ala Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly 
15 20 25 

CTC AGC CTG GAG CCT GTC TAC TGG AAC TCG GCG AAT AAG 3 63 
Leu Ser Leu Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys 
30 35 40 

AGG TTC CAG GCA GAG GGT GGT TAT GTG CTG TAC CCT CAG 4 02 
Arg Phe Gin Ala Glu Gly Gly Tyr Val Leu Tyr Pro Gin 

45 50 

ATC GGG GAC CGG CTA GAC CTG CTC TGC CCC CGG GCC CGG 441 
lie Gly Asp Arg Leu Asp Leu Leu Cys Pro Arg Ala Arg 
55 60 65 

CCT CCT GGC CCT CAC TCC TCT CCT AAT TAT GAG TTC TAC 480 
Pro Pro Gly Pro His Ser Ser Pro Asn Tyr Glu Phe Tyr 
70 75 
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AAG CTG TAC CTG GTA GGG GGT GCT CAG GGC CGG CGC TGT 519 
Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg Arg Cys 
80 85 90 

GAG GCA CCC CCT GCC CCA AAC CTC CTT CTC ACT TGT GAT 558 
Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 
95 100 105 

CGC CCA GAC CTG GAT CTC CGC TTC ACC ATC AAG TTC CAG 597 
Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin 

110 115 

GAG TAT AGC CCT AAT CTC TGG GGC CAC GAG TTC CGC TCG 63 6 
Glu Tyr Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser 
120 125 130 

CAC CAC GAT TAC TAC ATC ATT GCC AC A TCG GAT GGG ACC 675 
His His Asp Tyr Tyr He He Ala Thr Ser Asp Gly Thr 
135 140 

CGG GAG GGC CTG GAG AGC CTG CAG GGA GGT GTG TGC CTA 714 
Arg Glu Gly Leu Glu Ser Leu Gin Gly Gly Val Cys Leu 
145 150 155 

ACC AGA GGC ATG AAG GTG CTT CTC CGA GTG GGA CAA AGT 753 
Thr Arg Gly Met Lys Val Leu Leu Arg Val Gly Gin Ser 
160 165 170 

CCC CGA GGA GGG GCT GTC CCC CGA AAA CCT GTG TCT GAA 792 
Pro Arg Gly Gly Ala Val Pro Arg Lys Pro Val Ser Glu 

175 180 

ATG CCC ATG GAA AGA GAC CGA GGG GCA GCC CAC AGC CTG 831 
Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser Leu 
185 190 195 

GAG CCT GGG AAG GAG AAC CTG CCA GGT GAC CCC ACC AGC 870 
Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser 
200 205 



AAT GCA ACC TCC CGG GGT GCT GAA GGC CCC CTG CCC CCT 9 09 
Asn Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro 
210 215 220 

CCC AGC ATG CCT GCA GTG GCT GGG GCA GCA GGG GGG CTG 948 
Pro Ser Met Pro Ala Val Ala Gly Ala Ala Gly Gly Leu 
225 230 235 

GCG CTG CTC TTG CTG GGC GTG GCA GGG GCT GGG GGT GCC 987 
Ala Leu Leu Leu Leu Gly Val Ala Gly Ala Gly Gly Ala 
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240 



245 



ATG TGT TGG CGG AGA CGG CGG GCC AAG CCT TCG GAG AGT 102*6 
Met Cys Trp Arg Arg Arg Arg Ala Lys Pro Ser Glu Ser 
250 255 260 

CGC CAC CCT GGT CCT GGC TCC TTC GGG AGG GGA GGG TCT.1065 
Arg His Pro Gly Pro Gly Ser Phe Gly Arg Gly Gly Ser 
265 270 

CTG GGC CTG GGG GGT GGA GGT GGG ATG GGA CCT CGG GAG 1104 
Leu Gly Leu Gly Gly Gly Gly Gly Met Gly Pro Arg Glu 
275 280 285 

GCT GAG CCT GGG GAG CTA GGG ATA GCT CTG CGG GGT GGC 1143 
Ala Glu Pro Gly Glu Leu Gly lie Ala Leu Arg Gly Gly 
290 295 300 

GGG GCT GCA GAT CCC CCC TTC TGC CCC CAC TAT- GAG AAG 1182 
Gly Ala Ala Asp Pro Pro Phe Cys Pro His Tyr Glu Lys 

305 310 

GTG AGT GGT GAC TAT GGG CAT CCT GTG TAT ATC GTG CAG 1221 
Val Ser Gly Asp Tyr Gly His Pro Val Tyr lie Val Gin 
315 320 325 

GAT GGG CCC CCC CAG AGC CCT CCA AAC ATC TAC TAC AC A 12 60 
Asp Gly Pro Pro Gin Ser Pro Pro Asn lie Tyr Tyr Thr 
330 335 

TCG ATT TCT GTG TTG GAG TGG CCC ATA TTG CAT ACG ATA 12 9 9 
Ser lie Ser Val Leu Glu Trp Pro lie Leu His Thr lie 
340 345 350 

CAA CTG TTT TTC ATG CGA TCC AAG TGC TCC CGT GTC ACT 13 3 8 
Gin Leu Phe Phe Met Arg Ser Lys Cys Ser Arg Val Thr 
355 360 365 

ACA TTC TTA TTT CCT GTG CAA GTT ATT ACG ACA TCG ACT 1377 
Thr Phe Leu Phe Pro Val Gin Val lie Thr Thr Ser Thr 

370 375 

TGC CGG ATG ACT TCA TTT AGC TTT ACC ACC CTG AAC CCA 1416 
Cys Arg Met Thr Ser Phe Ser Phe Thr Thr Leu Asn Pro 
380 385 390 

TCC ATG CAG GCC TGC AGA GCA CAG ATG GGG GAA TTC CGA 145 5 
Ser Met Gin Ala Cys Arg Ala Gin Met Gly Glu Phe Arg 
395 400 
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ATC AG A TGG TGT TTC TGG GGG GAC AGG ATC CTG GGT ACG 1494 
lie Arg Trp Cys Phe Trp Gly Asp Arg lie Leu Gly Thr 
405 410 415 

GCT CTG TTT GTG CTT GTG CTT ATT CTT CTT CTT GGG AGG 153 3 
Ala Leu Phe Val Leu Val Leu lie Leu Leu Leu Gly Arg 
420 425 430 

CTG AAT ATG CAT CAG ACG ACA CTG CTC CGG CAA CGG GCC 1572 
Leu Asn Met His Gin Thr Thr Leu Leu Arg Gin Arg Ala 

435 440 

AGT GTG GAG GCG GAA GCC GGC CAG CAT GGT CCC CTG TG 1610 
Ser Val Glu Ala Glu Ala Gly Gin His Gly Pro Leu 
445 450 455 

ATAGGATTGA AAGAGCTACT GAGAATAGGG GGCTTCTCAA TGAGAGAGCG 166 0 

GAGGCTGCTG TTATCATGGG AACCAGGCAG ATCAATCATC CCTGGCAGGT 1710 

CAGGCAGGAA GTTACTTAGC TTCTCCTTCA CCTTCTTCCC ACAGAATTTA 176 0 

TTATAGGCTT GTTCCAAGTT GTAGTGTGTG ATCAGATTCG TGCTGCCTGT 1810 

CAGCTCTGTG CTACCTGGCA GTTCCCCTCA TGGAATTCGA TATCAAGCTT 1860 

ATCGATACCG TCGACCT 1877 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly Ala 
15 10 15 

Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly Leu Ser Leu 

20 25 30 

Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys Arg Phe Gin Ala Glu 

35 40 45 

Gly Gly Tyr Val Leu Tyr Pro Gin lie Gly Asp Arg Leu Asp Leu 

5 0 55 60 

Leu Cys Pro Arg Ala Arg Pro Pro Gly Pro His Ser Ser Pro Asn 



75 



65 70 75 

Tyr Glu Phe Tyr Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg 

80 85 90 

Arg Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 

95 100 105 

Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 

110 115 120 

« 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 

125 130 135 

Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu Leu 

155 160 165 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala Val Pro Arg Lys Pro 

170 175 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser 

185 190 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser Asn 

200 205 210 

Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro Pro Ser Met 

215 220 225 

Pro Ala Val Ala Gly Ala Ala Gly Gly Leu Ala Leu Leu Leu Leu 

230 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 

245 250 255 

Ala Lys Pro Ser Glu Ser Arg His Pro Gly Pro Gly Ser Phe Gly 

260 265 270 

Arg Gly Gly Ser Leu Gly Leu Gly Gly Gly Gly Gly Met Gly Pro 

275 280 285 

Arg Glu Ala Glu Pro Gly Glu Leu Gly lie Ala Leu Arg Gly Gly 

290 295 300 

Gly Ala Ala Asp Pro Pro Phe Cys Pro His Tyr Glu Lys Val Ser 

305 310 315 
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Gly Asp Tyr Gly His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 

320 325 330 



Gin Ser Pro Pro Asn lie Tyr Tyr Thr Ser lie Ser Val Leu Glu 

335 340 345 

Trp Pro lie Leu His Thr lie Gin Leu Phe Phe Met Arg Ser Lys 

350 355 360 

Cys Ser Arg Val Thr Thr Phe Leu Phe Pro Val Gin Val lie Thr 

365 370 375 

Thr Ser Thr Cys Arg Met Thr Ser Phe Ser Phe Thr Thr Leu Asn 

380 385 390 

Pro Ser Met Gin Ala Cys Arg Ala Gin Met Gly Glu Phe Arg lie 

395 400 405 

Arg Trp Cys Phe Trp Gly Asp Arg lie Leu Gly Thr Ala Leu Phe 

410 415 420 

Val Leu Val Leu lie Leu Leu Leu Gly Arg Leu Asn Met His Gin 

425 430 435 

Thr Thr Leu Leu Arg Gin Arg Ala Ser Val Glu Ala Glu Ala Gly 

440 445 450 

Gin His Gly Pro Leu 

455 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 80 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GNTCTAGAAN TAGTGGATCC CCCCGGGCTG CAGGAATTCC GACGGCCCCT 5 0 

GGAAGGGCTC TGGTGGGGCT GAGCGCTCTG CCGCGGGGGC GCGGGCACAG 100 

CAGGAAGCAG GTCCGCGTGG GCGCTGGGGG CATCAGCTAC CGGGGTGGTC 150 

CGGGCTGAAG AGCCAGGCAG CCAAGGCAGC CACCCCGGGG GGTGGGCGAC 2 00 

TTTGGGGGAG TTGGTGCCCC GCCCCCCAGG CCTTGGCGGG GTCATGGGGC 250 
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CCCCCCATTC TGGGCCGGGG GGCGTGCGAG TCGGGGCCCT GCTGCTGCTG 300 

GGGGTTTTGG GGCTGGTGTC TGGGCTCAGC CTGGAGCCTG TCTACTGGAA 3 50 

CTCGGCGAAT AAGAGGTTCC AGGCAGAGGG TGGTTATGTG CTGTACCCTC 400 

AGATCGGGGA CCGGCTAGAC CTGCTCTGCC CCCGGGCCCG GCCTCCTGGC 450 

CCTCACTCCT CTCCTAATTA TGAGTTCTAC AAGCTGTACC TGGTAGGGGG 500 

TGCTCAGGGC CGGCGCTGTG AGGCACCCCC TGCCCCAAAC CTCCTTCTCA 550 

CTTGTGATCG CCCAGACCTG GATCTCCGCT TCACCATCAA GTTCCAGGAG 600 

TATAGCCCTA ATCTCTGGGG CCACGAGTTC CGCTCGCACC ACGATTACTA 650 

CATCATTGCC ACATCGGATG GGACCCGGGA GGGCCTGGAG AGCCTGCAGG 7 00 

GAGGTGTGTG CCTAACCAGA GGCATGAAGG TGCTTCTCCG AGTGGGACAA 7 50 

AGTCCCCGAG GAGGGGCTGT CCCCCGAAAA CCTGTGTCTG AAATGCCCAT 800 

GGAAAGAGAC CGAGGGGCAG CCCACAGCCT GGAGCCTGGG AAGGAGAACC 850 

TGCCAGGTGA CCCCACCAGC AATGCAACCT CCCGGGGTGC TGAAGGCCCC 900 

CTGCCCCCTC CCAGCATGCC TGCAGTGGCT GGGGCAGCAG GGGGGCTGGC 950 

GCTGCTCTTG CTGGGCGTGG CAGGGGCTGG GGGTGCCATG TGTTGGCGGA 1000 

GACGGCGGGC CAAGCCTTCG GAGAGTCGCC ACCCTGGTCC TGGCTCCTTC 1050 

GGGAGGGGAG GGTCTCTGGG CCTGGGGGGT GGAGGTGGGA TGGGACCTCG 1100 

GGAGGCTGAG CCTGGGGAGC TAGGGATAGC TCTGCGGGGT GGCGGGGCTG 1150 

CAGATCCCCC CTTCTGCCCC CACTATGAGA AGGTGAGTGG TGACTATGGG 12 0 0 

CATCCTGTGT ATATCGTGCA GGATGGGCCC CCCCAGAGCC CTCCAAACAT 12 50 

CTACTACAAG GTATGAGGGC TCCTCTCACG TGGCTATCCT GAATCCAGCC 13 00 

CTTCTTGGGG TGCTCCTCCA GTTTAATTCC TGGTTTGAGG GACACCTCTA 13 50 

ACATCTCGGC CCCCTGTGCC CCCCCAGCCC CTTCACTCCT CCCGGCTGCT 140 0 

GTCCTCGTCT CCACTTTTAG GATTCCTTAG GATTCCCACT GCCCCACTTC 1450 

CTGCCCTCCC GTTTGGCCAT GGGTGCCCCC CTCTGTCTCA GTGTCCCTGG 15 00 
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ATCCTTTTTC CTTGGGGAGG GGCACAGGCT CAGCCTCCTC TCTGACCATG 1550 
ACCCAGGCAT CCTTGTCCCC CTCACCCACC CAGAGCTAGG GGCGGGAACA 1600 
GCCCACCTTT TGGTTGGCAC CGCCTTCTTT CTGCCTCTCA CTGGTTTTCT 1650 
CTTCTCTATC TCTTATTCTT TCCCTCTCTT CCGTCTCTAG GTCTGTTCTT 17 0 0 
CTTCCCTAGC ATCCTCCTCC CCACATCTCC TTTCACCCTC TTGGCTTCTT 1750 
ATCCTGTGCC TCTCCCATCT CCTGGGTGGG GGCATCAAAG CATTTCTCCC 1800 
CTTAGCTTTC AGCCCCCCTT CTGACCTCTC ATACCAACCA CTCCCCTCAG 1850 
TCTGCCAAAA ATGGGGGCCT TATGGGGAAG GCTCTGACAC TCCACCCCAG 1900 
CTCAGGCCAT GGGCAGCAGG GCTCCATTCT CTGGCCTGGC CCAGGCCTCT 195 0 
ACATACTTAC TCCAGCCATT TGGGGTGGTT GGGTCATGAC AGCTACCATG 2 00 0 
AGAAGAAGTG TCCCGTTTTG TCCAGTGGCC AATAGCAAGA TATGAACCGG 2 050 
TCGGGACATG TATGGACTTG GTCTGATGCT GAATGGGCCA CTTGGGACCG 2100 
GAAGTGACTT GCTCCAGACA AGAGGTGACC AGGCCCGGAC AGAAATGGCC 2150 
TGGGAAGTAG CAGAAGCAGT GCAGCAGGAA CTGGAAGTGC CTTCATCCAG 22 00 
GACAGGAAGT AGCACTTCTG AAACAGGAAG TGGTCTGGCT GGAACTCCAA 22 50 
GTGGCTTAGT CTGGGGGATC AGGAGGTGGG AGGTGGATGG TTCTTATTCT 23 00 
GTGGAGAAGA AGGGCGGGAA GAACTTCCTT TCAGGAGGAA GCTGGAACTT 2 3 50 
ACTGACTGTA AGAGGTTAGA GGTGGACCGA 23 80 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly Ala 
15 10 ' 15 

Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly Leu Ser Leu 

20 25 30 

79 



Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys Arg Phe Gin Ala Glu 

35 40 45 



Gly Gly Tyr Val Leu Tyr Pro Gin lie Gly Asp Arg Leu Asp Leu 

50 55 60 

Leu Cys Pro Arg Ala Arg Pro Pro Gly Pro His Ser Ser Pro Asn 

65 70 75 

Tyr Glu Phe Tyr Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg 

80 85 90 

Arg Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 

95 100 105 

Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 

110 115 120 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 

125 130 135 

Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu Leu 

155 160 165 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala Val Pro Arg Lys Pro 

170 175 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser 

185 190 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser Asn 

200 205 210 

Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro Pro Ser Met 

215 220 . 225 

Pro Ala Val Ala Gly Ala Ala Gly Gly Leu Ala Leu Leu Leu Leu 

230 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 

245 250 255 

Ala Lys Pro Ser Glu Ser Arg His Pro Gly Pro Gly Ser Phe Gly 

260 265 270 

Arg Gly Gly Ser Leu Gly Leu Gly Gly Gly Gly Gly Met Gly Pro 

275 280 " 285 

80 



Arg Glu Ala Glu Pro Gly Glu Leu Gly lie Ala Leu Arg Gly Gly 

290 295 300 



Gly Ala Ala Asp Pro Pro Phe Cys Pro His Tyr Glu Lys Val Ser 

305 310 315 

Gly Asp Tyr Gly His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 

320 325 330 

Gin Ser Pro Pro Asn lie Tyr Tyr Lys Val 

335 340 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



GCCGACGCTG TGAGGCACCC CCTGCCCCAA ACCTCCTTCT CACTTGTGAT 50 

CGCCCAGACC TGGATCTCCG CTTCACCATC AAGTTCCAGG AGTATAGCCC 100 

TAATCTCTGG GGCCACGAGT TCCGCTCGCA CCACGATTAC TACATCATTG 15 0 

CCACATCGGA TGGGACCCGG GAGGCCTGGG AGAGCCTGCA GGGAAGTGTG 200 

TGCCTAACCA GAGGCATGAA GGTGCTTCTC CGAGTNGGAC AAAGTCCCGA 250 

GGAGGGGCTG TCCCCCGAAA ACCTGTGTCT GAAATGCCCA TGGAAAGAGA 3 00 

CCGAGGGGCA GCCCACAGCC TGGGAGCCTG GGGAAGGAGA ACCTGCCAGG 3 50 

TGACCCCACC AGCAATNCAA CCTTCCGGGG TTGCTTGAAG GGCCCCTTGA 4 00 

CCCTTTCCCA GCATTGCNTG CANTTGGTTN GGGGCAGCAN GGGGGNGTTT 450 
TGGC 454 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
{ D ) TOPOLOGY : Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Lys Thr His Thr Cys Pro Pro Cys Pro 
15 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GGACAAAGTC CCGAGGAGGG GCTGTCCCCC GAAAACCTGT GTCTGAAATG 50 
CCCATGGAAA 60 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



CAGGTTCTCC TTCCCCAGGC TCCCAGGCTG TGGGCTGCCC CTCGGTCTCT 50 
TTCCATGGGC 60 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Ala Arg Pro Gly Gin Arg Trp Leu Gly Lys Trp Leu Val Ala 
15 10 15 

Met Val Val Trp Ala Leu Cys Arg Leu Ala Thr Pro Leu Ala Lys 

20 25 30 
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Asn Leu Glu Pro Val Ser Trp Ser Ser Leu Asn Pro Lys Phe Leu 

35 40 45 



Ser Gly Lys Gly Leu Val lie Tyr Pro Lys lie Gly Asp Lys Leu 

50 55 60 

Asp lie lie Cys Pro Arg Ala Glu Ala Gly Arg Pro Tyr Glu Tyr 

65 70 75 

Tyr Lys Leu Tyr Leu Val Arg Pro Glu Gin Ala Ala Ala Cys Ser 

80 85 90 

Thr Val Leu Asp Pro Asn Val Leu Val Thr Cys Asn Arg Pro Glu 

95 100 105 

Gin Glu lie Arg Phe Thr lie Lys Phe Gin Glu Phe Ser Pro Asn 

110 115 120 

Tyr Met Gly Leu Glu Phe Lys Lys His His Asp Tyr Tyr lie Thr 

125 130 135 

Ser Thr Ser Asn Gly Ser Leu Glu Gly Leu Glu Asn Arg Glu Gly 

140 145 150 

Gly Val Cys Arg Thr Arg Thr Met Lys lie lie Met Lys Val Gly 

155 160 165 

Gin Asp Pro Asn Ala Val Thr Pro Glu Gin Leu Thr Thr Ser Arg 

170 175 180 

Pro Ser Lys Glu Ala Asp Asn Thr Val Lys Met Ala Thr Gin Ala 

185 190 195 

Pro Gly Ser Arg Gly Ser Leu Gly Asp Ser Asp Gly Lys His Glu 

200 205 210 

Thr Val Asn Gin Glu Glu Lys Ser Gly Pro Gly Ala Ser Gly Gly 

215 220 225 

Ser Ser Gly Asp Pro Asp Gly Phe Phe Asn Ser Lys Val Ala Leu 

230 235 240 

Phe Ala Ala Val Gly Ala Gly Cys Val He Phe Leu Leu He He 

245 250 255 

He Phe Leu Thr Val Leu Leu Leu Lys Leu Arg Lys Arg His Arg 

260 265 270 

Lys His Thr Gin Gin Arg Ala Ala Ala Leu Ser Leu Ser Thr Leu 

275 280 285 
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Ala Ser Pro Lys Gly Gly Ser Gly Thr Ala Gly Thr Glu Pro Ser 

290 295 300 



Asp lie lie lie Pro Leu Arg Thr Thr Glu Asn Asn Tyr Cys Pro 

305 310 315 

His Tyr Glu Lys Val Ser Gly Asp Tyr Gly His Pro Val Tyr lie 

320 325 330 

Val Gin Glu Met Pro Pro Gin Ser Pro Ala Asn lie Tyr Tyr Lys 

335 340 345 

Val 
346 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Val Arg Arg Asp Ser Val Trp Lys Tyr Cys Trp Gly Val 

15 10 15 

Leu Met Val Leu Cys Arg Thr Ala lie Ser Lys Ser lie Val Leu 

20 25 30 

Glu Pro lie Tyr Trp Asn Ser Ser Asn Ser Lys Phe Leu Pro Gly 

35 40 45 

Gin Gly Leu Val Leu Tyr Pro Gin lie Gly Asp Lys Leu Asp lie 

50 55 60 

lie Cys Pro Lys Val Asp Ser Lys Thr Val Gly Gin Tyr Glu Tyr 

65 70 75 

Tyr Lys Val Tyr Met Val Asp Lys Asp Gin Ala Asp Arg Cys Thr 

80 85 *" 90 

lie Lys Lys Glu Asn Thr Pro Leu Leu Asn Cys Ala Lys Pro Asp 

95 100 105 

Gin Asp lie Lys Phe Thr lie Lys Phe Gin Glu Phe Ser Pro Asn 

110 115 120 

Leu Trp Gly Leu Glu Phe Gin Lys Asn Lys Asp Tyr Tyr lie lie 

125 130 " " 135 
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Ser Thr Ser Asn Gly Ser Leu Glu Gly Leu Asp Asn Gin Glu Gly 

140 145 ' 150 

Gly Val Cys Gin Thr Arg Ala Met Lys He Leu Met Lys Val Gly 

155 160 " 165 

Gin Asp Ala Ser Ser Ala Gly Ser Thr Arg Asn Lys Asp Pro Thr 

170 175 ~ 180 

Arg Arg Pro Glu Leu Glu Ala Gly Thr Asn Gly Arg Ser Ser Thr 

185 190 195 

Thr Ser Pro Phe Val Lys Pro Asn Pro Gly Ser Ser Thr Asp Gly 

200 205 210 

Asn Ser Ala Gly His Ser Gly Asn Asn He Leu Gly Ser Glu Val 

215 220 ' 225 

Ala Leu Phe Ala Gly He Ala Ser Gly Cys He He Phe He Val 

230 235 240 

He He He Thr Leu Val Val Leu Leu Leu Lys Tyr Arg Arg Arg 

245 250 * 255 

His Arg Lys His Ser Pro Gin His Thr Thr Thr Leu Ser Leu Ser 

260 265 270 

Thr Leu Ala Thr Pro Lys Arg Ser Gly Asn Asn Asn Gly Ser Glu 

275 280 ' 285 

Pro Ser Asp He He He Pro Leu Arg Thr Ala Asp Ser Val Phe 

290 295 300 

Cys Pro His Tyr Glu Lys Val Ser Gly Asp Tyr Gly His Pro Val 

305 310 " 315 

Tyr He Val Gin Glu Met Pro Pro Gin Ser Pro Ala Asn He Tyr 

320 325 330 

Tyr Lys Val 
333 
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